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Abstract 

Halogen bonds (XBs) are essential non-covalent interactions in molecular recognition 
and drug design. Current studies on XBs in drug design mainly focus on the 
interactions between halogenated ligands and target proteins, lacking a systematic 
study on naturally existing and artificially prepared halogenated residue XBs (hr_XBs) 
and their characteristics. Here, we conducted a computational study on the potential 
hr XBs in  proteins/peptides using database searching, quantum mechanics 
calculations, and molecular dynamics simulations. XBs at protein-peptide interaction 
interfaces are found to enhance their binding affinity. Additionally, the formation of 
intramolecular XBs (intra XBs) within proteins may significantly contribute to the 
structural stability of structurally flexible proteins, while having a minor impact on 
proteins with inherently high structural rigidity. Impressively, introducing halogens 
without the formation of intra XBs may lead to a decrease in protein structural 
stability. This study enriches our comprehension of the roles and effects of 


halogenated residue XBs in biological systems. 
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1. Introduction 


In recent decades, it was confirmed that halogen atoms have the capabilities similar to 
hydrogen atoms when interacting with Lewis bases, which was called halogen bond 
(XB)!. The anisotropic charge distribution surrounding covalently bonded halogen 
atoms gives rise to a positively charged electrostatic region recognized as the o-hole*“, 
which forms along the extension of C—X bonds (X = Cl, Br, I) and interacts 
attractively with the nucleophile**. The strength of the XB relies on the size of the 
o-hole, which corresponds to the trend of halogen polarizability [Cl < Br < I]’. 
Additionally, as the strength of the XB increases, their geometric shapes become more 
ideal (shorter bond lengths with angles closer to linear arrangement). Due to their 
critical role in the structure and function of molecular interactions in biological 
systems, XBs have attracted more and more interest in rational drug design*!. 

In biological systems, the majority of research about XBs concentrate on the 
interactions between small organohalogens and proteins. A comprehensive analysis of 
structural data available in the Protein Data Bank (PDB) has illuminated potential 
interaction partners for halogenated ligands in the proteins*. In ligand-protein 
complexes, common XB donors are organohalogens, whereas the dominant acceptors 
consist of carbonyl oxygens in the protein backbone® 13-15, The strength of the 
organohalogen XB can be additionally modulated by the electron-withdrawing 
capacity of substituents covalently bonded to a given scaffold, and the core scaffold 
that the halogen is attached to also play a role!® !3 16., Numerous pharmaceutical 
ligands incorporate halogen atoms into their molecular structures, aiming to optimize 
the membrane permeability and metabolic stability of small molecules'’. Based on a 
comprehensive database survey of Thomson Reuters Pharma and ZINC, Xu et al. 
noted that around one-third to one-quarter of compounds at different stages of drug 
discovery or launch are organohalogens, and halogenation may significantly 
contribute to improving protein—ligand binding affinity as well as ligand ADME/T 
properties!®. 

When it comes to proteins/peptides, it’s reported that naturally existing 


halogenated amino acids have an impact on both the structure and function of these 


proteins. For instance, halogenated tyrosine and tryptophan in marine organisms have 
been found to promote protein cross-linking!’’, while chlorotyrosine and 
bromotyrosine produced through oxidative halogenation are biomarkers for human 
asthma’’. In addition, there are artificially modified halogenated proteins/peptides. It 
has been reported that the introduction of halogenated residue XBs can modify 
enzyme thermal stability, activity, and substrate selectivity *!3. Furthermore, the 
work of the Máté Erdélyi group suggests that the introduction of XBs can alter local 
protein folding”*. 

To the best of our knowledge, no systematic study has been performed on the 
roles of halogenated residue XBs (hereinafter hr XBs) in proteins/peptides. In this 
study, we conducted an analysis on the hr XBs in proteins/peptides with PDB*, 
followed by quantum mechanics (QM) calculations or molecular dynamics (MD) 
simulations, to explore the roles of hr_XBs. The results show that the intermolecular 
halogenated residue XBs between proteins and peptides can enhance their binding 
affinity. In addition, halogens within proteins have diverse impacts on protein 
structural stability, depending on both the formation of intramolecular halogenated 
residue XBs and the protein's inherent structure features. The results could be useful 
for researchers to utilize hr XBs in halogenated peptide drug design and protein 


engineering. 


2. Methods 

2.1 Structure Survey for Halogenated Protein 

The PDB ” database (September 13, 2022 release) was used to explore the hr_XBs in 
proteins/peptides. Considering the long-range nature of electrostatic interactions and 
the fact that XBs are underestimated in PDB structures**”’, both the typical tŝand 
looser geometry criteria of XBs were applied for detecting (potential) hr XBs. The 
distance and angle of the looser criteria were defined to be 1 A longer than the sum of 
van der Waals radii (Table S1) and larger than 120° (a < 70° and 0 > 110° for the 
C-X:--m type of XBs), respectively. 


2.2 QM/MM Optimization 
QM/MM was performed to optimize the proteins/peptides systems using the two-layer 
ONIOM (our own N-layered integrated molecular orbital and molecular mechanics) 


1° as implemented in Gaussian 16°!. The key residues for proteins/peptides 


mode 
involved in hr XBs were included in the QM region, whereas the rest were contained 
in the MM region. The QM region was described at the M06-2X/6-31g(d) level??? for 
all atoms (LanL2DZ basis set** in the case of iodine) and the MM region was 


described by the AMBER parm96 force field*>. 


2.3 Halogen Bonding Analysis 

We conducted a Natural Bond Orbital (NBO) analysis to investigate the 
intermolecular electron transfer during the formation of hr XBs. Additionally, the 
region, size and species of the weak interaction of hr XBs between the proteins and 
peptides were studied by the Independent Gradient Model (IGM) analysis***’. The 
analyses of molecular electrostatic potentials (MEPs) for halogenated residues, NBO 
and IGM were undertaken by the Multiwfn program**’, based on the obtained wave 
functions at M06-2X/6-311+G(d,p) level’. As for proteins, the adaptive 
Poisson-Boltzmann equation solver (APBS)*° was used to compute the MEPs. All the 


analyses mentioned above were conducted based on QM/MM optimized structures. 


2.4 Calculation of the Binding Energy 

In order to evaluate the binding strength of halogenated and non-halogenated peptides 
to proteins, single point energy was calculated at M06-2X/6-311+G(d,p) level 
(LanL2DZ basis sett in the case of iodine) and the binding energy between peptides 
and proteins were then assessed by eq 1. 

AE = Eom — Eresper — Erespro + BSSE (eq 1) 

where AE is the binding energy, Ecom is the energy of the whole complex in the QM 
region, E,,,rep and E,espro are the energies of the key residues in peptide and protein, 
respectively, the basis set superposition error (BSSE) are considered in the binding 


energies calculation". 


2.5 Extra Point (EP) Model 

To account for intramolecular halogenated residue XBs in MD simulation, 
halogenated residues in proteins were parametrized using an EP model*?*3. The 
structural optimization and electrostatic potential (ESP) calculation of halogenated 
residues were carried out at the HF/6-31G(d) level for all atoms (LanL2DZ basis set** 
in the case of iodine). An EP was then introduced to attach to the halogen atom along 
the axis of the C-X bond at a distance ranging from 1.00 to 2.50 A with an increment 
of 0.01 A, maintaining a fixed C—X---EP angle of 180°. The partial atomic charges 
were subsequently obtained by applying the Restrained ElectroStatic Potential (RESP) 
method using the Multiwfn software*®. The selected X---EP distance was determined 
based on the best fit to the QM electrostatic potential, corresponding to the minimum 
relative root mean squared error (RRMSE) values. In addition to the X--:-EP distance, 


we employed EP parameters similar to those reported in previous studies (Table $2)”. 


2.6 MD Settings 

Each protein system was solvated into a periodic TIP3P water box, extending 10 A 
from the solute, and was neutralized by placing a rational number of counterions of 
Na* or Cl. The topology of protein system was generated using amber ff14SB force 
fields. About 8000 steps of minimization including 6000 steps of steepest descent 
minimization and 2000 steps of conjugate gradient minimization were performed to 
remove bad contacts during the energy minimization phase. The equilibration phase 
was then carried out separately in the NVT and NPT ensembles, each for 20 ps. 
Finally, the 150 ns MD production simulation was performed at 300 K, 1 bar, using a 
Langevin thermostat’ and a Nosé-Hoover Langevin barostat*>*° to control the 
temperature and pressure. Bonds involving hydrogen atoms were fixed by the 
SHAKE algorithm*’. The cutoff distance applied for van der Waals interactions was 
12 A. The particle mesh Ewald method (PME) was used to deal with long-range 
electrostatic interactions. All MD simulations were performed using the GROMACS 


software package, version 2019.648. Three simulation replicates were run for each 


system. 


2.7 MD Analysis 

The root-mean-square deviation (RMSD) values were calculated for heavy atoms 
using initial structure as the reference. During the equilibrium period, the analysis of 
the root-mean-square fluctuation (RMSF) for the combined 3 trajectories was 
performed to evaluate the flexibility of the proteins. The radius of gyration (Rg) was 
calculated as a measure of the size and compactness for proteins. To analyze the effect 
of halogenation on protein structural stability, we used the software CONtact Analysis 
(CONAN)”’ to calculate the local interaction time (LIT) for individual protein 
residues of the combined trajectories. MDAnalysis°°>! was used to calculate the 


occurrence of intramolecular halogenated residue XBs from the trajectories. 


3. Results And Discussion 


3.1 PDB Survey 

Among the 195,093 structures in PDB, there are a total of 423 structures of 
proteins/peptides containing 2430 halogen atoms, 949 of which are heavy halogen 
atoms (Cl, Br and I). With typical XB criteria, 84 heavy halogen atoms (8.9%) were 
found to form 101 hr XBs. However, if the looser criteria were applied, 295 heavy 
halogen atoms (31.1%) may form 532 potential hr XBs, indicating that the hr XBs 
might be highly underestimated in PDB structures. Among the 532 potential hr_XBs, 
350 intermolecular halogenated residue XBs were found between the proteins and the 
halogenated peptides (hereinafter inter XBs), while 182 intramolecular halogenated 
residue XBs were identified within the proteins (hereinafter intra XBs). The 
distribution of halogen atoms and potential hr_XB in the 949 heavy halogen atoms are 
shown in Figure la. Compared with chlorine (35.6%) and bromine (21.0%), iodine 
(43.4%) is the most popular halogen atom in proteins/peptides. As shown in Figure 1b, 
considering previous literature, both organohalogen XBs and hr_XBs demonstrate a 


preference for sulfur atom as acceptor. However, for hr XBs, side chain atoms 


(nitrogen and oxygen) exhibit a higher propensity to act as acceptors compared to the 
main chain atoms, and this trend is not pronounced in organohalogen XBs*’, which 
may due to the preference of atom types around halogens in proteins/peptides (see 


Figure S1). 
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Figure 1 Statistical results of halogenated proteins/peptides. (a) Distribution of 
halogen atoms (excluding F) and potential hr XB formations. (b) The elements 
propensity for hr XBs (X:--Y type). Propensity is defined as the relative frequency of 
each element in hr XBs divided by its relative background frequency in the refined 
dataset (Identical hr XBs occurring in the same system have been removed). 


“No halogenated residue XB formed 


3.2 The Strength of Intermolecular halogenated residue XBs (inter_XBs) 

In order to explore the inter XBs between proteins and peptides, we selected four 
systems for further QM/MM calculation. As evidenced by the results presented in 
Table 1, the optimized structures conform to the typical geometry criteria for XBs and 
have a better linear contact than their PDB structures (the geometries of the Cl system 
1OKW are provided in Table S3). The geometries of the protetn—peptide complexes 
after QM/MM optimization are shown in Figure 2a. It can be found that the XB 
regions change greatly after QM/MM optimization, which is different from the 
optimization results of halogenated ligand-protein systems**. However, the overall 
structural changes in the complex (including host protein and peptide) after 


optimization are minimal (Figure S2a), suggesting that the significant changes in XB 


region might be caused by the relatively high flexibility of halogenated residues side 
chains. Furthermore, our research reveals that most of the non-XB interactions (HB, 
ionic, cation-x interaction, etc.) observed in crystal structures are preserved in 
optimized structures, with additional non-XB interactions formed after optimization 
(Figure S2b). Studies on the optimized structures of halogenated and non-halogenated 
complex suggest that the non-XB interactions may positively influence the XB region, 
while halogens still remain the primary contributors to the XB formation (Figure S3). 
The results of QM/MM optimization indicate that the inter_XB present at the 
protein-peptide interface in PDB structure can be well recovered (The details of XBs 


region for the systems calculated here are provided in Table S8). 


Table 1 Protein-peptide complexes selected for QM/MM optimization 
PDBID XB mode Crystal QM/MM 


d(X-¥), Å Z(C-X+-Y),deg  d(X--Y), Å Z(C-X+Y),deg 


SITF* =C-Br-:-O 4.37 146.4 3.03 170.6 
1CZI C-I::-O 3.94 157.1 2.96 157.6 
1GA4 C-I::-O 3.47 104.5 2.99 150.2 


a Calculation results for chains A and E (cAE) of the 5ITF system. The results for 


chains B and E (cBE) and chains D and F (cDF) are provided in Table S6. 
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Figure 2 Structural characteristics and MEP of the XB region. (a) Structure details of 
the 3 systems before and after QM/MM optimization. The PDB structures are shown 
in gray, while the QM/MM optimized structures are illustrated in cyan. Interaction 
distances (A) and angles (deg) for the PDB structures are presented in magenta, 
contrasting with the blue annotations for the optimized structures. (b) The MEP of the 
protein region located around the halogenated residues. (c) The MEP maps of the 


halogenated amino acid residues. 


The MEP maps of the residues involved in the inter XBs are shown in Figures 
2b and 2c. The red areas correspond to regions with negative potentials, indicating the 
nucleophilic sites, while the positive potentials regions described by the blue areas, 
indicating the electrophilic sites. The observation of positive electrostatic potential 
regions, known as o-holes, on the molecular surface of halogen atoms in all studied 
halogenated residues, along with the presence of negative electrostatic potential 
regions on the surface of the oxygen atoms of the XB acceptors, demonstrates the 


favorable electrostatic complementarity between the halogen atoms and the acceptor 


atoms. 

NBO analysis was conducted to explore the intermolecular electron transfer 
involved in the formation of inter_XBs. Table 2 presents the second-order 
perturbation energy (AE(2)) values between the inter XB donors and acceptors in 3 
systems. The NBO analysis results show that electrons transfer from the inter XB 
acceptor's lone pair electron orbital of the oxygen atom to the donor's antibonding 
orbital of the carbon—halogen bond. Such an electron transfer phenomenon is 
consistent with the basic profile of XBs that electron always transfer from the XB 


acceptors to the XB donors.>? 


Table 2 Natural bond orbitals and the E(2) results 


PDB ID E(2)sum* E(2)° Electron donor orbital Electron acceptor orbital 


SITF 2.86 1.33 LP(1)O BD*(1)C-Br 
0.51 LP(2)O 
1.02 LP(3)O 

1CZI 2.38 2.18 LP(1)O BD*(1)C-I 
0.20 LP(2)O 

1GA4 4.40 1.79 LP(1)O BD*(1)C-I 
0.99 LP(2)O 
1.62 LP(3)O 


“The sum of second-order perturbation energies 


b Second-order perturbation energy 


IGM analysis was carried out to explore the nature and strength of the weak 
interaction of the inter XBs in the 3 complex systems. The type of noncovalent 
interaction can be characterized by the sign of the second largest eigenvalue of the 
electron density Hessian matrix (sign (A2)). A negative value of sign (A2) x p indicates 
an attractive interaction, while a positive value suggests a repulsive interaction. A 


value of zero corresponds to a van der Waals interaction. The IGM isosurfaces for 3 


selected systems are displayed in Figure 3. The blue region in the center of the 
isosurface indicates the strong inter_XB interaction formed between the halogen atom 


and the acceptor atom. 
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Figure 3 The IGM analysis and sign(A2)p colored isosurfaces of 5gi"* = 0.01 a.u. for 
the selected systems (a) 5ITF, (b) 1CZI and (c) 1GA4 with ranging from -0.05 to 0.05 
au. The regions participating in inter XBs are displayed in a ball-and-stick format, 
where bromine atoms are rendered in brown and iodine in pink. The repulsive 
interaction regions are visualized in red, the weak interaction regions in green, and the 


strong attraction regions in blue. 


The analyses presented above, considering both the geometric and quantum 
chemical features of the XBs, indicate the formation of inter_XB interactions between 
proteins and peptides. Furthermore, we calculated the binding energy for these 
inter_XBs to investigate whether halogenation can enhance the binding affinity. For 
each system, the halogen atom (X) in the halogenated peptide was replaced with 
hydrogen atom (H), QM/MM optimization was performed on it, and the energy 
difference (AAE) was calculated. As shown in Table 3, the AAE values calculated in 
vacuum for all systems are negative, indicating that the presence of inter_XB 


enhances the protein—peptide binding interactions (the results calculated in 


chloroform are shown in Table S4). 


Table 3 The binding energies of optimized structures for 3 systems in vacuum 


(kcal/mol) 
System AEx? AEx? AAE* 
S5ITF4 -5.39 -1.92 -3.47 
1CZI -2.33 -1.42 -0.91 
1GA4 -4.21 -0.81 -3.40 


“ Binding energy of protein and halogenated peptide. The energies calculated for the 
crystal structures are shown in Table S5. 
b Binding energy of protein and non-halogenated peptide. 
€ For each system: AAE = AEx - AEn. 
d Calculation results for chains A and E (cAE) of the 5ITF system. The results for 


chains B and E (cBE) and chains D and F (cDF) are provided in Table S6. 


3.3 Intramolecular Halogenated Residue XBs (intra_XBs) and Its Effect on the 
Structural Stability of Halogenated Protein 

To implement the EP model in MD simulations, it is necessary to extract the optimal 
X: EP distances from two halogenated residues (4BF and TYR), which can best 
reproduce the ESP calculated by QM. As shown in Figures 4a and 4c, the optimal 
X- EP distances for Br (4BF) and I (IYR) are determined to be 1.37 A and 1.73 A, 
respectively, based on the minimum RRMSE values. Under these optimal X--:-EP 
distances, the EP model accurately reproduces the anisotropic nature of the ESP 
around the halogen atoms, with a o-hole well evident on the halogen atoms (Figures 


4b and 4d). 
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Figure 4 Distances of EP and MEP of halogenated residues (a, c) The X...EP distance 
fitting results of 4BF and TYR. (b, d) Electrostatic potential mapped onto the 0.001 a.u. 
electron density isosurface of 4BF and IYR using various methods (SP: single point 


charges). 


Using the EP parameters described above, MD simulations were performed 150 
ns each for the 3 systems (2ZXV, 3VN3, and 5KI8) to illustrate the structural stability 
upon halogenation. Notably, in the halogenated 2ZXV system (2ZXV_]), the average 
Rg measures 16.41 A, while in the non-halogenated counterpart (2ZXV_noX), it 
registers at 16.88 A (Figure 5), suggesting that 2ZXV_I is more stable than 
2ZXV_noX. For the 3VN3 system, analysis of Rg indicates minimal differences 
between 3VN3_noX (16.81 A) and the halogenated one (16.79 A). In the 5KI8 system, 
5KI8_Br has an average Rg of 17.35 A, while that of non-halogenated is 16.70 A. In 
summary, the MD analyses indicate that halogenation had different effects on protein 


structural stability: enhancing structural stability in 2ZXV, having little impact in 


3VN3, and decreasing structural stability in 5KI8. 
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Figure 5 Structural stability of the halogenated (X) and non-halogenated (noX) 
proteins for (a) 2ZXV, (b) 3VN3 and (c) 5KI8. Left column shows the Rgs of the 
three systems, while the distribution of Rgs during the stable time period and the 
mean values were shown in the right column (The common stable time period was 
selected for the trajectories before and after halogenation based on the time-dependent 
changes in RMSD as shown in Figure S4). The other two repeated simulation results 


for each system are shown in Figures S5 and S6. 


The MD results across the three systems reveal that introducing halogens yields 
varying impacts on protein structural stability. To uncover the underlying reasons for 
these diverse structural stability changes induced by halogens, we conducted further 
investigations. As protein structural stability is linked to weak intramolecular 
interactions, the introduction of intra XBs through halogen incorporation may 
increase protein structural stability™*. Therefore, we analyzed the intra XB formation 
from the combined trajectories of three systems and found that XBs were detected in 
2ZXV_I and 3VN3_JI, but not in 5KI8_Br system. The free energy landscapes (FELs) 
of 2ZXV_I and 3VN3_I trajectories are shown in Figure 6, exhibiting a single free 
energy minimum centered at relatively short I---O distances and interaction angles of 
~146° and ~168° respectively. We further performed QM/MM optimization on the 
structures of the lowest free energy, and the optimized intra XB regions formed a 
better linear contact (Figures 6c and 6f). These suggest that in the 2ZXV and 3VN3 
systems, the conformations that form intra XB are the dominant structures (The 
details of XBs region for the systems calculated here are provided in Table S8). 
Furthermore, in 2ZXV_I and 3VN3_I, the halogen is positioned meta to the tyrosine. 
In 150 ns trajectories, we observed the formation of a certain proportion of 
intramolecular HBs (with the hydrogen on the hydroxyl group of tyrosine acting as 
HB donor and halogen acting as HB acceptor), which can enhance the strength of the 
XBs (Figure S7 and Table S7). As for the 5KI8 system, due to the atoms that can 
serve as XB acceptors were far away from the halogen (Figure S8), and no intra XB 
was detected in the MD trajectories, we believe that there is no intra XB formed in 


the 5KI8 system. 
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Figure 6 Dominant conformations of 2ZXV_I and 3VN3_I. (a, d) FELs obtained 
from the simulations of 2ZXV_I and 3VN3_I along with (b, e) the corresponding 
lowest energy structures at 300 K. FEL was calculated using the bond length and bond 
angle formed by the atoms related to the intra XB as the reaction coordinates. The 
values are given in kcal/mol as indicated by the color bars. (c, f) QM/MM 


optimization results of the lowest energy structures obtained from the MD trajectories. 


We wondered whether the formation of intra XBs in proteins might also lead to 
increased other contacts. Therefore, we performed LIT (local interaction time) 


analysis for each system. Residue with higher LIT value typically demonstrates a 


greater degree of contact with surrounding residues. The difference in local 
interaction time (ALIT, LITx - LITnox) was then calculated on the combined 
trajectories within the stable time period to explore whether protein halogenation 
affects the degree of other residues’ contacts and thus protein stability. In the case of 
the 2ZXV system (Figure 7a), residues within a 6 A radius around the halogenated 
residue **'TYR (residue 26-42) showed ALIT values greater than 0, with a total ALIT 
sum of 12.28%. This suggests that 2ZXV_I has a larger overall LIT compared to 
2ZXV_noX, indicating that the introduction of halogen atoms enhances the LIT 
within the 2ZXV system. For the 3VN3 system, ALIT values exhibit minor 
fluctuations around the 0 value, and when combined with statistical analysis results 
(Figure 7c), it can be inferred that there is relatively little difference in the overall LIT 
before and after halogenation. In contrast, in the 5KI8 system, statistical analysis 
reveals a reversed trend for ALIT (Figure 7e), with a total ALIT sum of -4.65%. This 
indicates that the introduction of halogen atoms decreases the LIT within the 5KI8 
system. RMSF analyses demonstrated that the introduction of halogen atoms had a 
noticeable impact on the fluctuations of residues in the 2ZXV and 5KI8 systems 
(Figures 7b and 7f), while its effect on the 3VN3 system was minimal (Figure 7d), 


possibly due to the inherent structural rigidity of 3VN3. 
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Figure 7 Local characteristics of residues in 2ZXV, 3VN3 and 5KIB8. (a, c, e) ALIT 
among the three systems. The number of residues with ALIT > 0 or ALIT < 0 and the 
sum of ALIT > 0 or ALIT < 0 are labeled in the embedded tables. (b, d, f) The RMSF 


for the Ca atoms among the three systems. 


The MD simulation results suggest that the introduction of halogen atoms to 
form intra XBs appears to enhance the protein structural stability (as observed in 
2ZXV). Conversely, the introduction of halogen atoms without the formation of 
intra XBs may negatively impact the formation of residue contacts, leading to a 


decrease in protein structural stability (5KI8). Additionally, the structural 


characteristics of the protein itself may also be a contributing factor. For structurally 
rigid proteins (such as 3VN3), the structural impact on stability is relatively small 


even when intra_XBs are formed. 


4. Conclusion 


In this work, we made a systematic survey on the PDB database for the presence of 
halogenated residue XBs (hr_XBs) in proteins/peptides and found that the hr XBs 
involved in proteins/peptides could be classified into 2 cases: one is the 
intermolecular halogenated residue XBs (inter _XBs) at the protein-peptide binding 
interface, the other is the intramolecular halogenated residue XBs (intra _XBs) within 
the proteins. Statistical results show that the hr XBs are more prone to utilizing side 
chain atoms (nitrogen and oxygen) as XB acceptors than main chain atoms, whereas 
this trend is not notably pronounced in organohalogen XBs. Compared with 
halogenated ligands, the proteins/peptides have more space to form strong XBs after 
QM/MM optimization due to the flexibility of residues. Regarding the protein-peptide 
complexes (SITF, 1CZI, 1GA4), strong inter_XB interactions are discovered in all 
three systems and can enhance the binding affinity of the protein-peptide complexes. 
For the proteins with halogenated residues (2ZXV, 3VN3, 5KI8), halogens have 
different impacts on protein structural stability, depending on the formation of 
intra_XBs and the structural characteristics of proteins. In detail, the introduction of 
halogen atoms to form intra XBs appears to enhance the protein structural stability. 
However, the rigidity of the structure can mask the stabilizing effects brought about 
by the intra XBs. In comparison, the introduction of halogens that cannot form 
intra XBs may reduce the structural stability of the protein. The findings above 


provide a potential guide for protein engineering and drug design. 
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